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Amendments to the Specification 

Please repla« tbe paragraph on Page 1 , Unes 5*10 with the following tnarked-up replacement 
paragraph: 

- The present invention is related to U. S- Patent . titled "Array-Based 
Extensible Document Storage Format*^ (serial number 09? ^ ) . r efer r ed 09/652.2961. 

referred to herein as tbe ''first related invention", and U. S. Patent , titled 

"High-Pcrformance Extensible Document Transformation" (serial number 0 9 / filed 
09/653,080^. filed concurrently herewith. These related inventions are commonly assigned to 
International Business Machines Corporation (IBM), and are hereby incorporated herein by 
reference* — 

Please replace the paragraph on Page 2, lines 8-18 with the following marked-i^ replacement 
paragraph: 

- Business and consumer xise of distributed computing, also commonly referred to as 
network conqmting, has gained tremendous popularity in recent years. In this computing model, 
the data and/or programs to be used to perform a particular computing task typical^ reside on 
(ie. are "distributed" among) mote than one con^uter» whwe these mutttple computers are 
connected by a network of some type. The Internet, and the part of the Internet known as the 
World Wide Web (he r eina ft er, "Web^, Aie Web fhereinaften "Web"), are well-known exan^les of 
this type of environment wherein the multiple coirq)uters are connected using a public networic 
Other types of network environments in which distributed computing may be used include 
intranets, which are typically private networks accessible to a restricted set of users (such as 
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employees of a corporation), and extratiets (e.g., a corporate network which is accessible to other 
users than just the employees of the conopa^y which owns and/or manages the network, swch as 
the con9}any*s business partners). — 

Please replace the paragraph that begins on Page 3, line 13 and carries over to Page 4, line 5 with 
the foltowing nwked-up replacement paragraph: 

— The syntax of XML is extensible and flexible, and albws document developers to 
create tags to convey an explicit nested tree document stnictme (where the structure is 
determined from the relationship amoi^ the tags in a particular document). Furthermore, 
document developers can define their own tags which may have application-specijEk; semantics. 
Because of this cKtensibility, XML documents xaay be used to specify many diflferent types of 
information, for use in a virtually unlimited number of contexts. It is this extemibilxty and 
flexibility which is, in large part, responsible for the popularity of XML. (A number of XML 
derivative notations have been defibied, and continue to be defined, for particular purposes. 
-'VoiceXML" is an example of one such derivative. References herein to **XML'* are intended to 
inchide XML derivatives and semantically similar notations such as derivatives of the Standard 
Generalized Markup Language, or "SGML'*, from which XML was derived. Refer to ISO 8879, 
"Standard Generalfeed Markup Language (SGML)*', (1 986) for more information on SGML. 
Refer to ''Extensible Markup Language (XML), W3C Recommendation lO-Februaiy-1998" 
which is available frotn the World Wide Web Consortium, or ''WBC". for o n the World Wide Web 
ath t tp://www,w3:o r g/TR/1 9 9m:C '^ A u J-^ fui u iore information on XML,) 
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Please replac tihe paragraph on Page 18> lines 2-13 with the following marked-tip replacement 
paragraph: 

- Fig. 4A iUtistrates a simple structured document 400 vMch is represented in the . 
existing XML notation. This document contains 6 elements v^rich are organized in a 3-level 
hierarchy. The node having element name **root_eIement" 402 is the root node» being at the 
hi^iest level of the hierarchy. This node has 2 cWld nodes, having element names 
"level_one_eIementl" 410 and "Ievel_one_eIement2" 420. Node "level_ot>e_elementl" 410 also 
has 2 child nodes, which are the nodes having element names '"level^two^elementl 1" 412 and 
"level_two_eIementl2" 414, and n o de ^c ' vcl_tw o _el cm cn t 2" 420 node 'level^onc elcmcnt2" 420 
has a single child node having element name **level_two_e)ement2r' 422, A tree structure 430 
representing document 400 is shown in Fig. 4B, where the tags for the 6 elements are depicted 
inside rectangular shapes representing nodes of the tree and the data content corresponding to 
each node is shown inside an ellipse. This interpretation of an XML document 400 and its 
corresponding tree structure 430 are well known in the art. - 

Please replace the paragraph on Page 24, lines 1-10 with the following marked-up replacement 
paragraph: 

— The information for each attribute in the attribute list is also prejSerabiy delimited using 
a comma. Within each attribute's information, a period is preferably used as a delimiter. 
Referring to the example in Fig. 4A, node [[Bl]] 41fi has 2 attributes. The first has the attribute 
name "id" and the attribute value "r. Thus, the length of the attribute name is 2, and the length 
of the attribute vahie is 1 . Again using zero-based counting, the first attribute represented in the 
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attribute Jist fiar the node ^wn at 41 0 is therefore specified as **0.2,2. 1", meaning that the name 
of the attribute is found in the data buffer starting at position 0 for a length of 2 characteis, and 
the value is found starting at position 2 for a length of L As shown in Fig- 4C (see reference 
niffiti t>er480), the data bufer is preferably stored at the end of the mXML document. A parser 
can thi^fore avoid scanning th^ characters during the parsii^ process when they are not 
needed. — 

Please replace the paragraph on Page 24, lines 11 - 17 with the following maiked-upreplaceineriC 
paragraph: 

The second attribute of node 41Q of 01 attilbutL^ m this example hA<i the name 
"name" and the value "r*. The information for this second attribute is therefore specilied usmg 
the syntax "3A7.r\ meaning that the attribute's name is found in the data buffer starting at 
position 3 for a length of 4 characters and its vahie is found startiog at position 7 for a length of 1 , 
If a node has more than 2 attributes, this dot-delimited syntax is used for each such attribute, and 
is separated fix)m the other attribute specifications for this node using comnias as delimiters. As 
with the child Hst syntax, if a node has no attributes, the absence is indicated by specifying an 
enqpty attribute list. - 

Please replace the paragraph that begins on Page 24, line 19 and carries over to Page 25, line 5 
whh the following mariced-up replacement paragraph: 

" While the syntax used in the prefen^cmbodittient refers to the data 
starting positions and length vahies, as described for the attribute nanjes and values of the node at 
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412j in an alternative syntax the starting and ending positions within the data bufiTer may be used. 
Thus^ the specification for the first attribute of the node at 412 wotdd be expressed as 1 .2.2", 
meaning that the attribute name begins at position 0 and ends at position 1, and the attribute value 
begins and ends at position 2, Similarfy, the specification [[fro]] for the second attribute would be 
e?q>ressed as "3.6.7.7". Use of length values, as selected for the syntax of the preferred 
embodiment, will in general require sUghtly less space than use of ending positions. — 

Please replace the paragr^h on Page 25, lines 4-14 with the following marked-up replacement 
paragraph: 

— ThuSj it can be seen that the structure of an mXML document is explicitly specified 
within the document This information can be used to build a Document Object Model (*TX)M") 
tree, if desired. The DOM tree can then be processed as in the prior art. Alternatively, the 
mXML document notation can be traversed direct^, for example to tocate information about a 
particular node, to detemiine the overall structure of the document, or to otherwise operate upon 
the mXML document The mXML docimient may be stored using the array-based extensible 
document storage format described in the first related invention, resuhing in further processing 
efiSciencies (as described therein) when operating on a documexrt. (DOM is published as a 
Recommendation of the World Wide Web Consortium ("W3C"), titled 'TJocument Object Model 
(DOM) Level 1 Specification, Version 1 .0" (1 998 \ which is available from the W3C.> and 
avAilabL o n tlie Web at http://www,w3. o f6/TR/RCG-DOKf-Lct>U - l, ^T)ONr is j tradenKuk ty f 
Massachuse tt s Institu t e o f Tcchnotogy ) — 
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Please replace the paragraph that begins on Page 27, line 6 and carries over to Page 28, line 12 
with the following marked-up replacetnent paragraph: 

-- The XML notation includes a number of notational elements vrfiich are not strictly 
necessary for data-centered document specificatioa An XML subset referred to as "SML", for 
"Smq>le Markup Language", is current^ uxxler discussion in the technical community. This XML 
subset proposes use of a core set of XML syntax, and omission of features inchading attributes, 
processing instructions, etc. See, for example, a Web-published article entitled *'SML: 
Simplifying XML", which [[is]l 3sa§ written by Robert E, La Quey and published November 24, 
1 999, The is loca t ed at l Tt tp://www>xmltuAn/pul/l 999/1 1/smiyiudex.htn i J (published 1 1/24/99). 
^fbc preferred mXML syntax which is described herein provides support for one core set of XML 
notational elements (although not identical to the core set proposed for SML), where the basic 
node types inchide elements and attributes. More con9)licated XML documents containing 
additional node types can be supported by extending this preferred mXML syntax, wliere those 
additional node types inchide comments, processing instructions, CDATA, entity, entity 
reference, and document type nodes. In a preferred technique for specifying this extended mXML 
syntax, **text'* nodes are added to an mXML document to refer to the actual node content. A 
node specification for a node type such as those just Hsted preferably occurs in-line within the 
mXML document, in the same relative location where it appears in a corresponding XML 
document. This node specification preferably comprises a nuU value tn plara of the node name; a 
list pointir^ to one or nwre child nodes, as is used in fte node specifications which have been 
described, except that the children arc now text nodes; an empty attribute list; and a pair of special 
indicators as the node value specification. The starting position entry within the special indicator 
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pair is used to denote which type of other node is being represented. For exairple, a vahie of -2 
inay represent a comment^, w*ile a vahie of -3 represents a processing instruction, and so forth. 
The length entry within the special indicator pair is preferably set to -1 . The node specification 
for each of the child text nodes referenced from the special child list preferabiy also uses a null 
name, and a null child list and attribute list. The value entry in this child text node then (1) points 
to a location within the data buffer wfc^re the node's content is stored (preferably as a character 
string representing all the significant content from the source node), and (2) stores the length of 
this content. — 

Please replace the paragraph on Page 33, lines 7-17 with the feUotving THarked-up replacement 
paragraph: 

— Block 670 converts the node's attribute infonnation, if any, and writes this to the 
raXML buflfer, followed by a senri-cobn deUmiter. For each attribute of the current iwde that is 
located in the DOM tree, the attribute's nante and value are written to the data buffer in 
successive bcations. The position within the data buflfer where the mme begins, and its ler^th, 
arc written to the mXML buflfer as the first two dot-separated integers of the attribute 
specification. The data buffer counter that was initialized at Block [[610]] 620 is then 
incremented by the kmgth of the attrilHrte name. Similarly, the position within the data buflfer 
where the attribute vahie begins, and its length, are written to the mXML buflfer using the dot- 
separated notation (and after a dot that follows the attribute name's length), and the data buflfer 
counter is incren^nted by the length of the attribute value. If this node has more than one 
attribute, a comma is written to the mXML buflfer to delimit the dot-separated attribute 
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Specifications. — 
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